Describing intonation with a parametric model
نویسنده
چکیده
In this study a data-based approach to intonation modeling is presented. The model incorporates knowledge from intonation theories like the expected types of F 0 movements and syllable anchoring. The knowledge is integrated into the model using an appropriate approximation function for F 0 parametrization. The F 0 parameters that result from the parametrization are predicted from a set of features using neural nets. The quality of the generated contours is assessed by means of numerical measures and perception tests. They show that the basic hypotheses about intonation description and modeling are in principle correct and that they have the potential to be successfully applied to speech synthesis. We argue for a clear interface with a linguistic description (using pitch-accent and boundary labels as input) and discourse structure (using pitch-range normalized F 0 parameters), even though current text-to-speech systems usually still do not have the capability to predict most of the appropriate information.
منابع مشابه
Describing the development of intonational categories using a target-oriented parametric approach
In this paper we analyze the relation between adults’ intonational categories as described in the ToBI framework and children’s intonation contours, using a parametric approach and cluster evaluation methods. In the field of prosody, an increasing number of studies on the development of intonation apply the intonational categories of adult speech described as a sequence of high (H) and low (L) ...
متن کاملComparing two different principles of parametric F0 modeling
A number of data-based approaches to intonation modeling represent F0 movements using continuous parameters. This is contradictory to most intonation theories, which suggest that intonation can be modeled with a set of distinct phonological entities that are phonetically realized as F0 movements. This principle has rarely been incorporated into data-based intonation modeling. In this study we c...
متن کاملThe Copasul Intonation Model
A new data-driven and linguistically interpretable intonation model for the automatic analysis and synthesis of fundamental frequency contours is introduced: the CoPaSul model, which provides a contour-based (Co), parametric (Pa), and superpositional (Sul) intonation representation. Its application in F0 analysis and generation is described as well as its linguistic anchoring with respect to se...
متن کاملPersonality prediction based on intonation stylization
This study’s aim is to predict speaker personality from intonation patterns in spoken dialogs. Intonation patterns were extracted by a parametric superpositional stylization approach that allows for pattern description on a parametric as well as on a categorical level. Based on features derived from these representations we trained support vector machines and fitted generalized linear regressio...
متن کاملTotally data-driven intonation prediction model using a novel F0 contour parametric representation
This paper proposes a novel parametric representation of mandarin intonation based on orthogonal polynomial approximation. The polynomial is a simplified representation of Parallel Encoding and Target Approximation (PENTA) intonation model that includes a target component and an approximation component. We also propose predicting the polynomial parameters from linguistic and phonetic attributes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998